Generalised Discount Functions applied to a Monte-Carlo AI u Implementation

نویسندگان

Sean Lamont

John Aslanides

Jan Leike

Marcus Hutter

چکیده

In recent years, work has been done to develop the theory of General Reinforcement Learning (GRL). However, there are few examples demonstrating the known results regarding generalised discounting. We have added to the GRL simulation platform AIXIjs the functionality to assign an agent arbitrary discount functions, and an environment which can be used to determine the effect of discounting on an agent’s policy. Using this, we investigate how geometric, hyperbolic and power discounting affect an informed agent in a simple MDP. We experimentally reproduce a number of theoretical results, and discuss some related subtleties. It was found that the agent’s behaviour followed what is expected theoretically, assuming appropriate parameters were chosen for the Monte-Carlo Tree Search (MCTS) planning algorithm. Keywords— Reinforcement Learning, Discount Function, Time Consistency, Monte Carlo

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Generalised Discount Functions

In recent years, work has been done to develop the theory of General Reinforcement Learning (GRL). However, there are no examples demonstrating the known results regarding generalised discounting. We have added to the GRL simulation platform (AIXIjs) the functionality to assign an agent arbitrary discount functions, and an environment which can be used to determine the effect of discounting on ...

متن کامل

Uncertainties due to Fuel Heating Value and Burner Efficiency on Performance Functions of Turbofan Engines Using Monte Carlo Simulation

In this paper, the impacts of the uncertainty of fuel heating value as well as the burner efficiency on performance functions of a turbofan engine are studied. The mean value and variance curves for thrust, thrust specific fuel consumption as well as propulsive, thermal and overall efficiencies are drawn and analyzed, considering the aforementioned uncertainties based on various Mach numbers at...

متن کامل

Implementing The Generalised Hybrid Monte-Carlo Algorithm

UKQCD’s dynamical fermion project uses the Generalised Hybrid Monte Carlo (GHMC) algorithm to generate QCD gauge configurations for a non-perturbatively O(a) improved Wilson action with two degenerate sea-quark flavours. We describe our implementation of the algorithm on the Cray-T3E, concentrating on issues arising from code verification and performance optimisation, such as parameter tuning, ...

متن کامل

Design and Simulation of Photoneutron Source by MCNPX Monte Carlo Code for Boron Neutron Capture Therapy

Introduction Electron linear accelerator (LINAC) can be used for neutron production in Boron Neutron Capture Therapy (BNCT). BNCT is an external radiotherapeutic method for the treatment of some cancers. In this study, Varian 2300 C/D LINAC was simulated as an electron accelerator-based photoneutron source to provide a suitable neutron flux for BNCT. Materials and Methods Photoneutron sources w...

متن کامل

Secondary Particles Produced by Hadron Therapy

Introduction Use of hadron therapy as an advanced radiotherapy technique is increasing. In this method, secondary particles are produced through primary beam interactions with the beam-transport system and the patient’s body. In this study, Monte Carlo simulations were employed to determine the dose of produced secondary particles, particularly neutrons during treatment. Materials and Methods I...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2017

Generalised Discount Functions applied to a Monte-Carlo AI u Implementation

نویسندگان

چکیده

منابع مشابه

Generalised Discount Functions

Uncertainties due to Fuel Heating Value and Burner Efficiency on Performance Functions of Turbofan Engines Using Monte Carlo Simulation

Implementing The Generalised Hybrid Monte-Carlo Algorithm

Design and Simulation of Photoneutron Source by MCNPX Monte Carlo Code for Boron Neutron Capture Therapy

Secondary Particles Produced by Hadron Therapy

عنوان ژورنال:

اشتراک گذاری